Overcoming performance bottlenecks in using OpenMP on SMP clusters

نویسندگان

  • Woo-Chul Jeun
  • Yang-Suk Kee
  • Soonhoi Ha
  • Changdon Kee
چکیده

This paper presents a new parallel programming environment called ParADE to enable easy, portable, and high-performance computing for SMP clusters. Different from the prior studies, ParADE separates the programming model from the execution model: it enables shared-address-space programming while it realizes hybrid execution of message-passing and shared-address-space. To overcome the poor performance of conventional OpenMP on SDSM (Software Distributed Shared Memory), ParADE implements an intelligent OpenMP translator supporting efficient mutual exclusion and efficient page transmission. The experimental results on a Linux cluster demonstrate that ParADE reduces mutual exclusion overhead and overall execution time. 2008 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analyzing Memory Performance Bottlenecks in OpenMP Programs on SMP Architectures using ccSIM

NAGARAJAN, ANITA Analyzing Memory Performance Bottlenecks in OpenMP Programs on SMP Architectures using ccSIM. (Under the direction of Assistant Professor Frank Mueller). As computing demands increase, performance analysis of application behavior has become a widely researched topic. In order to obtain optimal application performance, an understanding of the interaction between hardware and sof...

متن کامل

Intone — Tools and Environments for OpenMP on Clusters of SMPs1

Clusters of small-scale SMP computers are becoming more and more common as high-performance computing needs have arised, not only in national scientific laboratories, but also in enterprises of various kinds. An SMP-cluster represents a sweet-spot of cost-efficiency compared to a larger SMP system or to a cluster with smaller nodes. With the emergance of OpenMP, shared memory computing has also...

متن کامل

High-performance Algorithms and Applications for Smp Clusters

The future of high-performance computing relies on the efficient and scalable use of clusters with symmetric multiprocessor (SMP) nodes and low-latency, high-bandwidth interconnection networks. Current examples of such platforms include Sun Ultra HPC machines, Compaq AlphaServers with Quadrics switches, SGI Origins, and the IBM SP system with SMP nodes. Moreover, the future of NASA missioncriti...

متن کامل

Design of OpenMP Compiler for an SMP Cluster

In this paper, we present a design of OpenMP compiler for an SMP cluster. Although clusters of SMPs are expected to be one of the cost-e ective parallel computing platforms, both of inter and intra node parallelism must be exploited to achieve high performance. These two levels of structure complicate parallel programming. The OpenMP is an emerging standard for parallel programming on shared-me...

متن کامل

Comparing the OpenMP, MPI, and Hybrid Programming Paradigm on an SMP Cluster

Clusters of SMP (Symmetric Multi-Processors) nodes provide support for a wide range of parallel programming paradigms. The shared address space within each node is suitable for OpenMP parallelization. Message passing can be employed within and across the nodes of a cluster. Multiple levels of parallelism can be achieved by combining message passing and OpenMP parallelization. Which programming ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Parallel Computing

دوره 34  شماره 

صفحات  -

تاریخ انتشار 2008